WHAT IS CLAIMED IS: 



1. For use in a wide-issue pipelined processor, a mechanism 

2 for reducing pipeline stalls between nested calls, comprising: 

3 a program counter (PC) generator that generates return PC 

4 values for call instructions in a pipeline of said processor; and 

5 return PC storage, coupled to said PC generator and located in 

6 an execution core of said processor, that stores said return PC 

7 values and makes ones of said return PC values available to a PC of 

%i said processor upon execution of corresponding return instructions. 

PI 

P4 2. The mechanism as recited in Claim 1 wherein said PC 

2*^; generator is associated with an instruction issue unit of said 
3; processor. 

3- The mechanism as recited in Claim 1 wherein said PC 
^ generator generates each of said return PC values in a single clock 
3 cycle. 

4. The mechanism as recited in Claim 1 wherein a return PC 

2 queue of said return PC storage has at least as many slots as a 

3 number of call instructions a fetch/decode stage of said pipeline 

4 can decode prior to grouping. 
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5. The mechanism as recited in Claim 1 wherein said return 
PC values move through registers of said return PC storage as 
corresponding ones of said return instructions move through stages 
in said pipeline - 

6. The mechanism as recited in Claim 1 wherein said return 
PC storage makes said ones of said return PC values available to a 
PC of said processor as said corresponding return instructions are 
in an execution stage of said pipeline. 

7. The mechanism as recited in Claim 1 wherein said call 
instruction is executed in a fetch/decode stage of said pipeline. 

8 . The mechanism as recited in Claim 1 wherein said 
processor is a digital signal processor. 
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9. For use in a wide-issue pipelined processor, a method of 

2 reducing pipeline stalls between nested calls, comprising: 

3 generating return PC values for call instructions in a 

4 pipeline of said processor; 

5 storing said return PC values in return PC storage located in 

6 an execution core of said processor; and 

7 making ones of said return PC values available to a PC of said 

8 processor upon execution of corresponding return instructions. 

10. The method as recited in Claim 9 wherein said generating 
2J is carried out in an instruction issue unit of said processor. 

m 

ff 11- The method as recited in Claim 9 wherein said generating 

comprises generating each of said return PC values in a single 
3f7 clock cycle. 

lU 

^ 12. The method as recited in Claim 9 wherein a return PC 

2 queue of said return PC storage has at least as many slots as a 

3 number of call instructions a fetch/decode stage of said pipeline 

4 can decode prior to grouping. 

13. The method as recited in Claim 9 further comprising 

2 moving said return PC values through registers of said return PC 

3 storage as corresponding ones of said return instructions move 
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through stages in said pipeline, 

14. The method as recited in Claim 9 wherein said return PC 
storage makes said ones of said return PC values available to a PC 
of said processor as said corresponding return instructions are in 
an execution stage of said pipeline. 

15. The method as recited in Claim 9 further comprising 
executing said call instruction in a fetch/decode stage of said 
pipeline • 

16. The method as recited in Claim 9 wherein said processor 
is a digital signal processor. 
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17. A digital signal processor, comprising: 

2 a pipeline having stages capable of executing call 

3 instructions; 

4 a wide-issue instruction issue unit; 

5 a program counter (PC) generator that generates return PC 

6 values for call instructions in a pipeline of said processor; and 

7 return PC storage, coupled to said PC generator and located in 

8 an execution core of said processor, that stores said return PC 

9 values and makes ones of said return PC values available to a PC of 
10 said processor upon execution of corresponding return instructions. 

bz 18. The DSP as recited in Claim 17 wherein said PC generator 

^^f' is associated with an instruction issue unit of said DSP. 

fT 19. The DSP as recited in Claim 17 wherein said PC generator 

i'l generates each of said return PC values in a single clock cycle. 

20. The DSP as recited in Claim 17 wherein a return PC queue 

2 of said return PC storage has at least as many slots as a number of 

3 call instructions a fetch/decode stage of said pipeline can decode 

4 prior to grouping. 

21. The DSP as recited in Claim 17 wherein said return PC 
2 values move through registers of said return PC storage as 
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3 corresponding ones of said return instructions move through stages 

4 in said pipeline. 

22. The DSP as recited in Claim 17 wherein said return PC 

2 storage makes said ones of said return PC values available to a PC 

3 of said DSP as said corresponding return instructions are in an 

4 execution stage of said pipeline. 

23. The DSP as recited in Claim 17 wherein said call 
^ instruction is executed in a fetch/decode stage of said pipeline. 
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